k-Adic Similarity Coefficients for Binary (Presence/Absence) Data

نویسنده

  • Matthijs J. Warrens
چکیده

k-Adic formulations (for groups of objects of size k) of a variety of 2adic similarity coefficients (for pairs of objects) for binary (presence/absence) data are presented. The formulations are not functions of 2-adic similarity coefficients. Instead, the main objective of the the paper is to present k-adic formulations that reflect certain basic characteristics of, and have a similar interpretation as, their 2adic versions. Two major classes are distinguished. The first class is referred to as Bennani-Heiser similarity coefficients, which contains all coefficients that can be defined using just the matches, the number of attributes that are present and that are absent in k objects, and the total number of attributes. The coefficients in the second class can be formulated as functions of Dice’s association indices.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

On the Indeterminacy of Resemblance Measures for Binary (Presence/Absence) Data

Many similarity coefficients for binary data are defined as fractions. For certain resemblance measures the denominator may become zero. If the denominator is zero the value of the coefficient is indeterminate. It is shown that the seriousness of the indeterminacy problem differs with the resemblance measures. Following Batagelj and Bren (1995) we remove the indeterminacies by defining appropri...

متن کامل

On the Coefficients of Binary Bent Functions

We prove a 2-adic inequality for the coefficients of binary bent functions in their polynomial representations. The 2-adic inequality implies a family of identities satisfied by the coefficients. The identities also lead to the discovery of some new affine invariants of Boolean functions on Z2 .

متن کامل

Privacy-preserving similarity coefficients for binary data

Similarity coefficients (also known as coefficients of association) are important measurement techniques used to quantify the extent to which objects resemble one another. Due to privacy concerns, the data owner might not want to participate in any similarity measurement if the original dataset will be revealed or could be derived from the final output. There are many different measurements use...

متن کامل

New Similarity Coefficients for Binary Data

In the last few decades, the use of similarity measures has been becoming more and more important due to the relevance of comparing samples in order to find out clusters of similar samples, to generate priority lists, and, in general, to discover patterns in data structures. In drug design, their relevance is already well established to search for the most suitable alternative to a target drug....

متن کامل

بررسی تنوع ژنتیکی جمعیت‌های گون‌های زرد و سفید در مناطق حفاظت شده استان اصفهان با استفاده از نشانگر ISSR

Genetic variation of 16 white and yellow astragal accessions collected from three protected regions of Isfahan province (Mooteh, Kolah-Ghazi and Ghamishloo) were evaluated using ISSR marker. Nine ISSR primers produced 221 bands in which 204 were polymorphic among astragal accessions. ISSR banding patterns were transformed into binary data of presence–absence and matrices were processed with NTS...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • J. Classification

دوره 26  شماره 

صفحات  -

تاریخ انتشار 2009